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Abstract 

We prove a concise factor-of-two estimate for the failure-rate of optimally dis- 
tinguishing an arbitrary ensemble of mixed quantum states, generalizing work 
of Holevo [Theor. Probab. Appl. 23, 411 (1978)] and Curlander [Ph.D. The- 
sis, MIT, 1979]. A modification of the minimal principle of Concha and Poor 
[Proceedings of the 6th International Conference on Quantum Communication, 
Measurement, and Computing (Rinton, Princeton, NJ, 2003)] is used to derive 
a sub-optimal measurement which has an error rate within a factor of two of 
the optimal by construction. This measurement is quadratically weighted, and 
has appeared as the first iterate of a sequence of measurements proposed by 
Jezek, Rehacek, and Fiurasek [Phys. Rev. A 65, 060301]. Unlike the so-called 
"pretty good" measurement, it coincides with Holevo's asymptotically-optimal 
measurement in the case of non-equiprobable pure states. A quadratically- 
weighted version of the measurement bound by Barnum and Knill [J. Math. 
Phys. 43, 2097 (2002)] is proven. Bounds on the distinguishability of syn- 
dromes in the sense of Schumacher and Westmoreland [Phys. Rev. A 56, 
131 (1997)] appear as a corollary. An appendix relates our bounds to the 
trace-Jensen inequality. 
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I Introduction 



The minimum-error quantum distinguishability problem is of obvious practical im- 
portance in the design of optical detectors and of fundamental importf 
subject of quantum information and quantum computation ji-10]: 



tance m 



If an unknown state pk is randomly chosen from a known ensemble of 
quantum states, what is the chance that the value of k will be discovered 
by an optimal measurement? 

Although various necessary and sufficient conditions for optimal measurements have 
been derived ll-17j (see also 18]), and a number of numerical algorithms for com- 



puting optimal measurements have been implemented |18l-l22l]. it is unlikely that an 
explicit general solution is forthcoming. A number of worksgive interesting general 



upper and/or lower bounds on quantum distinguishability [2j, I23M29 



The theory of optimal measurements has been generalized in several directions, in- 
cluding to Belavkin and Maslov's theory of wave discrimination [3o| and to the theory 
of optimal quantum channel reversals, in the sense of average entanglement fidelity 



24 



31-35 



Success rates of optimal measurements have recently been expressed in 
terms of the conditional min-entropy of bipartite classical-quantum states in Theo- 
rem 1 of [36]. In particular, the problem of finding the conditional min-entropy of an 
arbitrary bipartite quantum state generalizes the optimal distinguishability problem. 



IA Results 



Theorem [9] of section IIVI combines ideas of Holevo [37 
cha & Poor 



39 



Curlander 



and Con- 



41] to give mathematically concise upper and lower distinguishabil- 
ity bounds for arbitrary ensembles of mixed quantum states. Employing an ap- 
proximate minimal principle, a suboptimal measurement is derived which has a 
failure rate within a factor of two of the optimal by construction. This measure- 
ment is observed to be the first iteration in the sequence of measurements of Jezek, 
Rehacek, and Fiurasek. [2(J In the case of pure states this measurement reduces 
to Holevo's asymptotically-optimal measurement 37], which is the quadratically- 



weighted Belavkin square root measurement. \l4. Il5 



42 



Theorem [TU] of section |V] combines ideas of Curlander and Holevo to give some- 
what tighter distinguishability bounds. Furthermore, a quadratically-weighted ver- 
sion of Barnum and Knill's measurement bounds [24| are obtained, as are bounds on 
distinguishability of syndromes in the sense of Schumacher and Westmoreland. 

To motivate our considerations in the case of mixed states, section [Til revisits 
Holevo and Curlander's pure-state bounds. 

Future directions appear in the final section. The appendix relates our bounds 
to the trace- Jensen inequality. 
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II Hole vo- Cur lander pure state distinguishability 
bounds 



Before attempting to distinguish elements of mixed-state ensembles, it is instructive 
to revisit a pure-state bound used by Holevo in his proof of the asymptotic optimality 
Theorem (Theorem [HI below). 



Theorem 1 (Holevo [37]) The ensemble £ m = {(?/>&, Pk)} k =i m of linearly-independent 
pure states has the following minimum- error distinguishability bound: 

PS!"* < holevo, (1) 

where 



1 Holevo = 1 - Tr / JjOfc \lf>k) (V'fcl; 



and Pfaa m is the failure rate of the optimal measurement. 
A two-sided version of Holevo's bound (Tj[|) 

^optimal 



popima ^ [7 Holevo , 27Holevo] , (3) 



was proved using different techniques by Curlander [38| under the additional as- 
sumption of equiprobability (p k = 1/m). Equation (j3J) follows without Curlander's 
restriction by the following trivial modification of Holevo's argument. 

Holevo restricted attention to orthonormal von Neumann measurement bases 
{efe}J3 with phases chosen so that 

(e fc ,Vfc>>0. (4) 
Instead of minimizing the probability of failure 

m 

Pfaii({e fc }) = ^R.(l-|(e fc ,^)| 2 ), (5) 
fc=i 

Holevo considered the tractable approximation 

m 

^Holevo ({ e fc}) = ^Pk HV'fe ~ e k \\ 2 , (6) 



k=l 



which is equation 8 of [37]] . Since the phase condition (j4j) implies that 

1- |(^,e fc )| 2 = l|efc ~/ fc|1 (l + (^ fc , efc )) G [1/2,1] x \\e k -^ k \\\ (7) 



Rank-1 project ive measurements are optimal for distinguishing linearly-independent pure 
states.! 



ank-1 proiecti 
■ 0,0, IIIH 
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one has 

Pm ({e*}) e [1/2, 1] x Cnoievo ({e fc }) , (8) 
where we use the notation [a, b] x c = [ac, be}. The bound ([3]) follows from minimiza- 
tion of Cnoievo • The minimizer is the (usually sub-optimal) measurement basig^] 

e Holevo = |^ } ^y 1 ' 2 ^ |^) . (9) 

III Definitions, background, and notation: 

In this section we collect the technical definitions and mathematical background 
needed for the rest of the paper. Throughout we shall consider an ensemble 

€m = {(Pk,Pk)}k=l,..., m ( 10 ) 

of quantum states p k on a Hilbert space TC with a-priori probabilities p k , with ^^Pfc = 
1 and Tr pk = 1. One may take m = oo without changing our results. For the special 
case of pure states, pk will be denoted by pk = \ipk) (i>k\- 

Definition 2 The ensemble £ m is equiprobable ifpk = 1/m for all k. The subspace 
Span (S m ) is the span of the ensemble 8 m , i.e. the span of the ranges of the pkPk- 
A positive-operator valued measure (POVM) (see, for example, p. 74 off]]]) 
for distinguishing £ m is a collection of positive semidefinite operators {Mk\ k=1 m 
such that ^2 Mk = 1-n or ispan(£ , m ) ■ The probability that the value % is detected when 
the POVM is applied to the state pj is given by p^j = TrMipj. In particular, the 
success rate for the POVM to distinguish the ensemble S m is given by 

m 

Psucc (M fc ) = ^p k Tr (p k M k ) = 1 - P faU . (11) 
k=i 

The optimal success rate is 

P^r l = sup P succ (M fc ) = 1 — Pfati imal - (12) 

POVMs {M k } 

A common POVM is 



Definition 3 The Belavkin-Hausladen-Wootters "pretty good measurement" 
(PGM@ is given by 

/ m \ -V2+ / m \ -V2+ 

M k = f ^jpePe J PkPk [y^PePej , (13) 



2 The final section of Holevo's paper contains minor algebra errors. A corrected version of the 
minimizer ([9]) appears in [iBj ]. which also removes Holevo's assumption of linear-independence. 
(This generalization may also be accomplished simply by using Naimark's theorem, as advocated 



by Kebo [46f| . ) Holevo's measurement ([9 



belonged to the previously-considered class of Belavkin 



weighted square root measurements. [14, Il5|. also called "weighted least-squares measurements 



[45j and "generalized 'pretty good' measurements" [44 1. 

3 The PGM for non-equiprobable pure states appeared in 1975 as an optimal measurement under 
conditions of equality along the diagonal of the Graham matrix 3, Hj], and reappeared in 1993 as 
an approximately-optimal measurement 47, 48}. 
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where one defines 

^ /2+ = £ A 7 1/2 i^>^i' (14) 

A,>0 

for a spectral decomposition A = ^^Aj \ (ipj\- 



Numerical evidence [20l. l2ll] suggests that the following sequence of measurements 
converges to the optimal measurement: 



Definition 4 The Jezek-Rehdcek-Fiurdsek iterative measurements { M k 



(«) 



n G Z + , are recursively defined by I2(i . \2i l 

= 1/m for m < oo, = 1 for m = oo (15) 

/ m \ -1/2+ / m \ -1/2+ 

Ml n) = YjftpMt^Pi vlpuMt^Pu Y,P^ M t 1] Pi ■ (16) 



k=l,...,m 



If S m is a pure-state ensemble then Holevo 's measurement iSn l is given by 

M k = \e% olevo ) (e£ olevo \ , (17) 

where ej? olevo is given by @. In particular, Holevo 's measurement is the pure-state 
version of the first Jezek-Rehdcek-Fiurdsek iterate. 



Remark: M\, is also a POVM for m < oo. Note that since the recursion formula 



r(o) 

(|T6j) is invariant under rescalings M^ 1 ^ — > A x M^ n x \ the cases m < oo and m = oo 
are essentially the same for n > 0. 

Holevo studied measurements which were asymptotically optimal in the following 
precise sense: 

Definition 5 A measurement procedure G is a mapping from ensembles to corre- 
sponding POVMs. It is asymptotically optimal \3n] for distinguishing pure-state 
ensembles if for fixed pi, ...,p m one has 



p G(£ m ) / c 

r fail \°r\ 



,opt/^ , 1 ( 18 ) 

fail 1 

as the states ipk of £ m approach an orthonormal setQ 
Holevo showed that 



4 It is presumably intractable to produce a closed-form measurement process G for which 
Pfaii^"* (£m) /-f > feji timal (£m) ^ 1 as the fa and pk are arbitrarily varied in such a way that 
Pfel\ lm {£m) — * 0. Otherwise, one could recover the optimal measurement for a fixed ensem- 
ble £ m on H by taking the A — > 1~ limit of the ensemble £' m+1 = {{ipk, (1 — typk)} U {(<fi, A)} on a 
dilation W D H, with 4>±.H. 
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Theorem 6 (Holevo's asymptotic-optimality Theorem (1977) [37]]) Holevo's 

s 

(19) 



measurement ([Pj) is asymptotically optimal. Furthermore, for fixed {pk} one has 

2T Holevo 



Pfail (£rn) 

as -> S i:j , where -f H oievo is given by ([ 



A converse was proven in 42 



The following norms will be used: 

Definition 7 Let 7i and K, be Hilbert spaces, and let A : 7i — > K, be a bounded linear 
operator. The absolute value is \A\ = V A^A. The trace norm is [|^4.|| x = Tr \A\. 
The Frobenius norm is \\A\\ 2 = yTiA^A. The operator norm is given by 

\\M\ = sup 1M. (20) 



A is an isometry if A^A = 1. 

It will be assumed that the reader is familiar with the following properties of the 



trace-norm, which may be found in [49 



1. \TtA\ < \\AWi = \\A\ 

2. \\WAWi < \\W\\ x HAIIj 

3. If dim/C > dim7i then 



sup Re (TrA^U) = \\A\h, (21) 

isomctries U:Ti.—*K 

where U is a maximizer iff 

^l Ra n(AU)=^(^) _1/2+ - (22) 

Note: Property 3 is a simple consequence of the singular-value decomposition. 
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IV Mixed-state distinguishability bounds using Holevo's 
method 

The first step in constructing a mixed-state version of the argument of section [III is 
to construct a mixed-state version of the underlying estimate ([7j): 

Lemma 8 Let p be a density matrix on Ti and let E : 7i — > 7i be an operator with 
\\E\\ < 1. Then 

1-Tt[e'e P ) G[l,2]x(l-||£/4) (23) 

Proof. The lower bound follows from the properties of the trace-norm: 

1 - Tr {E ] Ep) > 1 - WE^Ep^ > 1 - \\E\\ x \\Ep\\ x > 1 - \\Ep\\ x 

To prove the upper bound, define the pre-inner product on the bounded operators 
on TC by 



(E, F) p = Tr (tfFp) . 



H 



By Bessel's inequality 



Tr (E ] Ep) = \\E\\ 2 > sup 



(U,E) t 



sup ■ 



\Tr WEp\ 



U unitary \\U \\ p (Tip) 

Subtracting both sides from 1, 

1 - Tr (e'Ep) < (1 + \\Ep\\ x ) (1 - \\Ep\\ x ) < 2 (1 - \\Ep\\ x ) . 



\\Ep\\l- 



To find the measurement properly analogous to (Q , one simply needs to minimize 
the cost function arising from (|23|) : 

Theorem 9 Let M k = E ] k E k be a POVM on Span (S m ) minimizing the approximate 
cost function 

C{{E k })=YjPkO—\\E h p k \\ 1 ). (24) 



Then M k is the first Jezek-Rehdcek-Fiurdsek iterate ( 1 1 6j) 



,2 2 



-1/2+ 



and 
where 



T<P$ mal <P fa u({M k })<2T, 



T = T (£ m ) = min C ({E k }) = 1 - Tr 



\ fe=i 



(25) 
(26) 

(27) 



Remark: The approximate cost function (1241) is a somewhat-disguised modification 
of the minimal principle of Concha and Poor 39M41|. which was reverse-engineered 
to reproduce the ad-hoc mixed-state PGM. The measurement ( 12 5 p is an example of a 
mixed-state Belavkin weighted measurement. (See section 2.2 of 30j.) A discussion 



of the relative merits of various weightings, including the cubic weighting of |50j, [51 



may be found in [42 



Proof. By lemmaEl Pm UeIeA) G [1, 2] x C ({E k }) for all POVMs M k = E ] k E k . 

Hence all that is required to get a factor-of-two estimate of (£ m ) is to minimize 

C subject to the constraint Y^E^i^ = lspan(£ m )- Note that the replacement E k — > 

W k E k for unitary W k does not alter C ({E k }) or the quantities E\E k . Hence the 
polar decomposition allows imposition of the additional constraint E k p k > 0, giving 
the expression 

C({E k }) = C(U) = l-TrV^U, 
where U, V : Span (£ m ) — > Span (£ m ) <8> C m are defined by 

m 

Uip = ^2 \E k ifi) ® \k) £m 
k=i 

m 

V J I> = J~] \PkPk4>) ® \k) C n ■ 



k = l 



Here \k) Cm is the standard basis of C m . Note that U is an isometry iff M k 
is a POVM on Span (£ m ). By equation! 



E\E k 



min C(I7) = 1 - \\V\lj_ = 1 - Tr 

isometries U 



\ k=l 



with minimizer 
This gives 



u = v (yV) 



-1/2+ 



JTJ mm 



1 i>l. l ) l. | XI P t P t 



Since E™ n p k > 0, the theorem follows. 
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V Generalization of Curlander's upper bound 

The upper bound of (|26|) may be sharpened by combining Holevo's measurement (jUJ) 



with Curlander's argument of Ref. [38 



Theorem 10 The optimal failure rate for distinguishing the arbitrary mixed-state 
ensemble S m = {(pk,Pk)} k =i,..., m satis fi es 

r < PjS < Pfjr < r (2 - r) < 2T, (28) 

where Pflii RF is the failure rate of the measurement (Tf5j) and T = T (£ m ) is given by 
{21). Furthermore, 

P$ < PSf " < (1 + P%L) P&- (29) 



Note: Curlander proved ( l28l) in the special case of equiprobable pure states. [3i 
Barnum and Knill have already shown that the bound ( |29i) holds for the mixed-state 



"pretty good" measurement 24j. Note that the RHS of ( |29l) never exceeds 1, so the 
bound is always meaningful. 

NOTE ADDED TO ARXIV VERSION: It was not realized at the time of 
publication that the lower bound of (l28~l) admits a generalization using the theory of 



matrix monotonicity [54J|. Furthermore, this generalization is a minor variation of a 
similar bound of 55], n 



Proof. First restrict consideration to pure-state ensembles £ m = {(V'fc,Pfc)}fc = i 
By the convexity of x > x 2 and Jensen's inequalityj§ 



« F = £^W° leV °'^> 2 (30) 



> (E^< e " 0leV0 '^> 



2 



-1/2 



^JPk {tpk\ ^P 2 i \tpt) (tpe\ Pk \ipk) 



;i-r) 



2 



so that 



^ RF <i-(i-r) 2 = r(2-r). 

The left-most inequality of ([2"8l was already proved in Theorem [91 

In the more general case of mixed states, take spectral decompositions p& = 
J^ePke \i>ke) (V'/wl an d consider the pure-state ensemble 

S m = Wkt),Pkfikt)}' ( 31 ) 



5 An erratum or comment will be sent to JMP to this effect. 

6 The author's argument, which is similar to Curlander's, was originally movtivated by that used 
to prove Lemma 2 of [251 ]. 
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Note that any measurement {M^e} for £^ may be converted into a measurement 
Mfc = ^ £ Mfc£ for £ m , which trivially satisfies 

Pfaii {{M u }) > Pm {{M k }) . 

In particular, £^ is less distinguishable than £ m , and the measurement (1251) is less 
successful at distinguishing it. Then using (|26|) and the pure-state case, 

T (£m) < (Sm) < P f ^ RF (Sm) < P£T (£*J < T (S m ) (2 - V (S m )) . (32) 

Note that last inequality used the identity V (£^J = T (£ m ). 

Note that T e [0, 1) by (|27|) . Because 7 1— > 7 (2 — 7) is monotonic increasing on 
7 G [0, 1), the chain of inequalities (12"9]1 follows by plugging in the left-hand-side of 
the first inequality of (I28p into the right-hand-side of the third. ■ 
Remark: In Schumacher and Westmoreland's classic paper [3j, the elements of £ m 
appear as "codewords," with "syndromes" given by elements of £^. Schumacher and 
Westmoreland assert that measurements of £^ are "not really more difficult" than 
measurements of £ m . It is now easy to quantify this assertion: 



Corollary 11 Let £^ be the ensemble (BZj) of eigenvectors of the elements of £ m . 
Then 

P$ < Pf£ (O < (1 + (£m)) P£ (£m) ■ 

Proof. Simply replace the quantity -P f aii RF (£m) by P°^ (£m) m ^ ne chain of inequal- 
ities (1321) . and continue as in the proof of (1281) . ■ 



VI Reflections on the quadratic weighting 

As we have seen, the quadratic weighting gives rise to some particularly simple 
bounds for distinguishability of quantum states. For comparison, substituting the 
linearly-weighted "pretty good" measurement (fT3j) into equation (130!) gives the upper 
bound 

< 1 - Tr ( ^> iv*> ' {^vT m (V* 

The relative simplicity of the quadratic bound (1281 is not surprising. As shown by 
the author in 42j , Holevo's pure-state measurement has the following conceptual 



and practical advantages over the ad-hoc "pretty good" measurement Q 

1. Holevo's asymptotic-optimality property uniquely specifies Holevo's measure- 
ment among the class of Belavkin weighted measurements. 

2. Holevo's measurement categorically outperforms the PGM for ensembles of two 
pure states. 



7 It is of course assumed that the a-priori probabilities pk are not all the same, or Holevo's 
pure-state measurement and the PGM would be identical. 
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3. The optimality conditions for Holevo's measurement are particularly simple. 



The previous sections provide more examples of this theme: 

4. The quadratically-weighted mixed-state measurement gives particularly simple 
pure- and mixed-state distinguishability bounds. 



5 The approximate cost function ff24|) for the quadratic measurement is within a 
factor of two of the function Pf ai i ( Mb )- (The corresponding cost functions for 



the pure and mixed-states PGMs -Y.) 1 1 ll. |45| admit no such comparison. 



VII Conclusion and Future Directions 

As we have seen, mathematically concise (and reasonably tight) bounds on the dis- 
tinguishability of mixed quantum states may be obtained by combining the ideas of 
Holevo, Curlander, and Concha & Poor. In the above we have not explained the 
connection between these ideas and the iterative algorithm of Jezek, Rehacek, and 
Fiurasek, other than to recognize that a natural generalization of Holevo's argument 
gives the first iterate of Jezek et a/'s measurements. 

A proper setting to explore such questions is in the theory of approximate quan- 



tum channel reversals, which Barnum and Knill 24| have already investigated using 
a generalization of the "pretty good" measurement. We will consider an abstract 
form of JRF iteration, study its convergence properties, and construct bounds on 



channel reversibility and relative min-entropy in future work [52j, [56| . We will also 



attempt to reconsider Holevo's notion of asymptotic optimality in this setting. 
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a copy of Hausladen's thesis, and Stephanie Wehner for a valuable discussion. I would 
also like to thank Arthur Jaffe, Chris King, and Peter Shor for their encouragement 
and the editors and anonymous referees for their useful comments and suggestions. 



Appendix A: An application of the trace-Jensen in- 
equality 

The following theorem makes it transparent that 1 — Tr v/J^pIp! > 0, giving some 
insight into the bounds fl26|) and (128]) : 

Theorem 12 Let f : [0, oo) —>■ M. be concave with f (0) = ; and consider positive 
semidefinite operators Aj. on a Hilbert space 7i. Then 

(N \ N 

J>J <TrJ2f( A k), 
k=l J k=l 
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where f (A) is defined using the functional calculus (I n particular, f (A) 

(Aj) for any spectral decomposition A = ^^Aj \ipi) (ipi\.) 



Proof. The case N = 2 is sufficient. By the trace- Jensen inequality [53 

Tr / (A) = Trf (e\ (A + B) E x + £ 2 f 0£ 2 ) 

> Tr (Elf (A + B) E 1 + Elf(0)E 2 



Tr (A 1/2 (A + By 1/2+ f(A + B)(A + B)~ 1/2+ A 1/2 ^j (Al) 



where 



E\ = A 1/2 (A + B) 



-1/2+ 



e 2 = Ji-e\e 1 . 



Similarly, 



Tr / (B) > Tr (b 1 ' 2 (A + B)~ 1/2+ f(A + B)(A + By 1/2+ B 1 ^ . (A2) 



The conclusion follows by adding (IA1I) and fl A2[) and applying the cyclicity of the 
trace. ■ 

Note added in proof: The lower bound r (£) < Pf a ;i (M£ pt ) of Theorem 10 admits 
a simple generalization proved using matrix monotonicity: 



1 - Tr 



for any s G [1, oo). This is addressed in a short note which has been submitted to 
this journal. [541] 
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